Rank in Wordlist | Frequency | Word |
---|---|---|
302 | 1514 | ό,τι |
2450 | 236 | 1,5 |
2896 | 200 | 2,5 |
4646 | 122 | 3,5 |
5945 | 94 | Ό,τι |
6137 | 91 | 4,5 |
6494 | 86 | 1,2 |
8343 | 65 | 1,8 |
8571 | 63 | 5,5 |
8913 | 60 | 6,5 |
Rank in Wordlist | Frequency | Word |
---|---|---|
103277 | 2 | Α(Η1Ν1 |
106960 | 2 | Εσ(ε)νόζ |
110133 | 2 | Μισθωτών(ΙΚΑ-ΕΤΑΜ |
129548 | 2 | πεντακοσίων(1 |
135629 | 1 | -(κοκορέτσι |
137046 | 1 | 1,50(APIVITA |
137115 | 1 | 1-1(4’Ογκουντζίμι |
137834 | 1 | 103,1(www |
139303 | 1 | 168040(ΦΕΚ |
140001 | 1 | 1974(9 |
Rank in Wordlist | Frequency | Word |
---|---|---|
106418 | 2 | Εγνατία)-Σιστρούνι |
106960 | 2 | Εσ(ε)νόζ |
116012 | 2 | α)Η |
128009 | 2 | ξανα)γράψαμε |
135830 | 1 | -6,24%),της |
136932 | 1 | 1)Ισοσκελισμό |
136933 | 1 | 1)Καπναποθήκη |
136934 | 1 | 1)Λειτουργία |
136935 | 1 | 1)Μία |
136936 | 1 | 1)η |
Rank in Wordlist | Frequency | Word |
---|---|---|
96763 | 2 | 10%-20 |
97731 | 2 | 35%-40 |
97785 | 2 | 4%-5 |
97923 | 2 | 5%-6 |
97961 | 2 | 50%+1 |
98185 | 2 | 7%-8 |
135830 | 1 | -6,24%),της |
136559 | 1 | 0%-0,24 |
136586 | 1 | 0,3%-0,6 |
136929 | 1 | 1%-1,20 |
Rank in Wordlist | Frequency | Word |
---|---|---|
13315 | 37 | S&P |
33549 | 11 | S&P 500 |
36010 | 10 | Β&Ε |
38640 | 9 | Q&R |
45997 | 7 | S&B |
57261 | 5 | J&P |
65963 | 4 | H&M |
79282 | 3 | S&Ρ |
98519 | 2 | AT&T |
99373 | 2 | D&G |
Rank in Wordlist | Frequency | Word |
---|---|---|
155625 | 1 | J$5 |
158391 | 1 | Micro$oft |
Rank in Wordlist | Frequency | Word |
---|---|---|
130789 | 2 | προσφυγιάς"-Ισίδωρος |
135626 | 1 | -"Δυό |
135627 | 1 | -"Η |
135628 | 1 | -"λοφίο |
140204 | 1 | 2"- |
140776 | 1 | 2009-10-06"Σήμερα |
144145 | 1 | 42%"οι |
147790 | 1 | ACS",που |
155072 | 1 | Hράκλειο"Minoan |
160796 | 1 | PROPITIATIO"(ΕΞΙΛΕΩΣΙΣ |
Rank in Wordlist | Frequency | Word |
---|---|---|
15388 | 31 | Poor's |
66304 | 4 | Sadler's Wells |
66324 | 4 | Sotheby's |
98634 | 2 | Assassin's Creed |
99218 | 2 | Champion's League |
99452 | 2 | Director's Cut |
100333 | 2 | King's College |
148562 | 1 | Aujourd'hui |
151485 | 1 | Commedia dell'Arte |
154231 | 1 | Gare de l'Est |
Rank in Wordlist | Frequency | Word |
---|---|---|
45563 | 7 | 1+1 |
56913 | 5 | 45’+1 |
65500 | 4 | 5+1 |
66746 | 4 | Α+Β |
77440 | 3 | 2+1 |
77441 | 3 | 2+2 |
92236 | 3 | ν+2 |
96951 | 2 | 15+1 |
97866 | 2 | 45+1 |
97877 | 2 | 45’+2 |
Rank in Wordlist | Frequency | Word |
---|---|---|
157947 | 1 | MO*VIDA |
177801 | 1 | ΓΑΡΟΥΦΑ*garoufas@otenet |
195009 | 1 | ΜΕΤΑΞΑ*g |
195010 | 1 | ΜΕΤΑΞΑ*Η |
208877 | 1 | ΣΑΧΤΑΡΙΔΗ*sahtaridis@winner |
213008 | 1 | ΤΑΝΑΚΙΔΗ*Το |
213145 | 1 | ΤΟΤΛΗ*Τις |
Rank in Wordlist | Frequency | Word |
---|---|---|
9212 | 58 | Η/Υ |
9472 | 56 | H/Y |
10413 | 50 | 1/2 |
10946 | 47 | 1/3 |
13057 | 38 | Δ/νση |
17748 | 26 | 2/3 |
17751 | 26 | 3/8 |
17769 | 26 | FTSE/ASE |
18521 | 25 | ή/και |
20352 | 22 | Πλημ/κών |
In the last subsection of this type we look for words containing other special characters: , ( ) % & $
" ' + * = / _
Depending on the language some of these characters may be allowed within words, other will not. If words with forbidden characters do not have very low frequency there might be a problem in preprocessing.
Words containing %:
select w_id-100,freq, word from words where w_id>100 and word like "%\%%" limit 10;
3.12.1 Words with Hyphens
3.12.2 Multiwords
3.12.3 (Multi-)Words with dots